Study of the influence of noise pre-processing on the performance of a low bit rate parametric speech coder

نویسندگان

  • Gwénaël Guilmin
  • Régine Le Bouquin-Jeannès
  • Philippe Gournay
چکیده

This paper describes a prospective study of the contribution of a single-sensor noise pre-processing method, prior to coding, to the performance of a parametric low bit rate speech coder in adverse conditions. The 2.4kbits/s vocoder we use estimates four parameters: fundamental frequency, voicing, linear prediction coefficients and energy. Firstly, we study the influence of different noise levels on the estimated parameters with and without noise reduction system. Secondly, we measure the contribution of (i) each speech coder parameter and (ii) the speech enhancement system to the global output intelligibility. Finally, results show the interest of such a speech enhancement system for low bit rate parameter estimation and underline the interest to adapt different pre-processing techniques for each parameter estimation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Planelet Transform: A New Geometrical Wavelet for Compression of Kinect-like Depth Images

With the advent of cheap indoor RGB-D sensors, proper representation of piecewise planar depth images is crucial toward an effective compression method. Although there exist geometrical wavelets for optimal representation of piecewise constant and piecewise linear images (i.e. wedgelets and platelets), an adaptation to piecewise linear fractional functions which correspond to depth variation ov...

متن کامل

Strategies to improve the performance of very low bit rate speech coders and application to a variable rate 1.2 kb/s codec - Vision, Image and Signal Processing, IEE Proceedings-

This paper presents several strategies to improve the performance of very low bit rate speech coders and describes a speech codec that incorporates these strategies and operates at an average bit rate of 1.2 kb/s. The encoding algorithm is based on several improvements in a mixed multiband excitation (MMBE) linear predictive coding (LPC) structure. A switched-predictive vector quantiser techniq...

متن کامل

Improving the performance of MFCC for Persian robust speech recognition

The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...

متن کامل

A Novel Frequency Domain Linearly Constrained Minimum Variance Filter for Speech Enhancement

A reliable speech enhancement method is important for speech applications as a pre-processing step to improve their overall performance. In this paper, we propose a novel frequency domain method for single channel speech enhancement. Conventional frequency domain methods usually neglect the correlation between neighboring time-frequency components of the signals. In the proposed method, we take...

متن کامل

Robustness of the Mbe Vocoder to Acoustic Background Noise

I.I.T. Bombay, in collaboration with C.R.L., B.E.L., is working on developing a speech codec to provide communication quality speech at low bit rates. The codec is required to be robust to acoustic background noise and to channel errors. A sinusoidal coder based on the MBE (Multiband Excitation) model has been adopted as the basic speech representation due to its compact parameter set and the r...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999